FPT Algorithms and Kernels for the Directed 
&-Leaf Problem 

Jean Daligault* Gregory GutinJ Eun Jung Kim* and Anders Yeo § 

August 18, 2009 

Abstract 

A subgraph T of a digraph D is an out-branching if T is an oriented spanning 
tree with only one vertex of in-degree zero (called the root). The vertices of T 
of out-degree zero are leaves. In the Directed Max Leaf Problem, we wish to 
find the maximum number of leaves in an out-branching of a given digraph D (or, 
to report that D has no out-branching). In the Directed £-Leaf Problem, we are 
given a digraph D and an integral parameter k, and we are to decide whether D 
has an out-branching with at least k leaves. Recently, Kneis et al. (2008) obtained 
an algorithm for Directed A>Leaf of running time 4 k ■ n 0{l \ We describe a new 
algorithm for Directed A>Leaf of running time 3.72*-n 0<1) . This algorithms leads to 
an 0( 1 .9973" )-time algorithm for solving Directed Max Leaf on a digraph of order 
n. The latter algorithm is the first algorithm of running time 0{y") for Directed 
Max Leaf, where y < 2. In the Rooted Directed £-Leaf Problem, apart from D 
and k, we are given a vertex r of D and we are to decide whether D has an out- 
branching rooted at r with at least k leaves. Very recently, Fernau et al. (2008) 
found an 0(k 3 ) -size kernel for Rooted Directed ^-Leaf. In this paper, we obtain 
an 0(k) kernel for Rooted Directed £-Leaf restricted to acyclic digraphs. 



1 Introduction 

The Maximum Leaf problem is to find a spanning tree with the maximum number of 
leaves in a given undirected graph G. The problem is well studied from both algorithmic 
IfTTl |T~8l [23l [25 1 and graph-theoretical IflOl [19l [20l l22l points of view. The problem has 
been studied from the parameterized complexity perspective as well and several authors 
[T3] Q4] have designed fixed parameter tractable (FPT) algorithms for solving the 

"Universite Montpellier II, LIRMM, 161 rue Ada, 34392 Montpellier Cedex 5 - France, 
daligault@lirmin.fr 

^Department of Computer Science, Royal Holloway, University of London, Egham, Surrey TW20 0EX, 
UK, gutin@cs . rhul .ac.uk 

* Department of Computer Science, Royal Holloway, University of London, Egham, Surrey TW20 0EX, 
UK, eunjung@cs.rhul.ac.uk 

§ Department of Computer Science, Royal Holloway, University of London, Egham, Surrey TW20 0EX, 
UK, anders@cs.rhul.ac.uk 



1 



parameterized version of Maximum Leaf (the £-Leaf problem): given a graph G and an 
integral parameter k, decide whether G has a spanning tree with at least k leaves. 

Maximum Leaf can be extended to digraphs. A subgraph T of a digraph D is an 
out-tree if T is an oriented tree with only one vertex of in-degree zero (called the root). 
The vertices of T of out-degree zero are leaves. If V(T) - V(D), then T is an out- 
branching of D. The Directed Maxfmum Leaf problem is to find an out-branching 
with the maximum number of leaves in an input digraph. The parameterized version 
of the Directed Maximum Leaf problem is Directed £-Leaf: given a digraph D and an 
integral parameter k, decide whether D has an out-branching with at least k leaves. If 
we add a condition that every out-branching in Directed &-Leaf must be rooted at a 
given vertex r, we obtain a variation of Directed £-Leaf called the Rooted Directed 
£-Leaf problem. 

The study of Directed A:-Leaf has only begun recently. Alon et al. JT] [2] proved 
that the problem is FPT for a wide family of digraphs including classes of strongly 
connected and acyclic digraphs. Bonsma and Dorn extended this result to all digraphs 
in JS], and improved the running time of the algorithm in ||2) to 2 kXogk n 0( - l) in |9|. Re- 
cently, Kneis et al. ||2TI obtained an algorithm for solving the problem in time 4- k n 0( - l \ 
Notice that the algorithm of Kneis et al. J21 1 applied to undirected graphs is of smaller 
running time (as a function of k) than all previously known algorithms for A:-Leaf. Yet, 
the algorithm of Kneis et al. 12TI is not fast enough to answer in affirmative the ques- 
tion of Fellows et al. [ 14 1 of whether there exists a parameterized algorithm for Max 
Leaf of running time f(k)n 0( - l \ where /(50) < 10 2() . Very recently, Fernau et al. lfT31 
proved that no polynomial kernel for Directed £-Leaf is possible unless the polyno- 
mial hierarchy collapses to the third level (they applied a recent breakthrough result of 
Bodlaender et al. [6 1). Interestingly, Rooted Directed £-Leaf admits a polynomial size 
kernel and Fernau et al. lfl5l obtained one of size 0(k 3 ). 

The only known approximation algorithm for Directed Max Leaf is due to Drescher 
and Vetta [ 12] and its approximation ratio is 0( V£na»(D)')> where f max (D) is the maxi- 
mum number of leaves in an out-branching of a digraph D. 

In this paper, we obtain an algorithm faster than the one of Kneis et al. ED for 
Directed £-Leaf. Our algorithm runs in time 3.72*n°^. Unfortunately, our algorithm 
cannot solve the above-mentioned question of Fellows et al. [ 14|, but it shows that the 
remaining gap is not wide anymore. We also obtain a linear size kernel for Directed 
£-Leaf restricted to acyclic digraphs. Notice that (i) Directed Max Leaf restricted to 
acyclic digraphs is still NP-hard (3], and (ii) for acyclic digraphs Directed £-Leaf and 
Rooted Directed £-Leaf are equivalent since all out-branchings must be rooted at the 
unique vertex of in-degree zero. 

We recall some basic notions of parameterized complexity here, for a more in-depth 
treatment of the topic we refer the reader to ITTl[T6ll24ll . 

A parameterized problem II can be considered as a set of pairs (/, k) where / is 
the problem instance and k (usually an integer) is the parameter. II is called fixed- 
parameter tractable (FPT) if membership of (/, k) in II can be decided in time 0(f(k)\I\ c ), 
where |/| is the size of /, f(k) is a computable function, and c is a constant independent 
from k and /. Let II be a parameterized problem. A reduction R to a problem kernel 
(or kerne lization) is a many-to-one transformation from (/, k) e II to (/', k') e IT, such 
that (i) (/, k) G II if and only if (/', k') e II, (ii) k' < k and |/'| < g(k) for some function 
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g and (iii) R is computable in time polynomial in |/| and k. In kernelization, an instance 
(I,k) is reduced to another instance (V ,k'), which is called the problem kernel; \I'\ is 
the size of the kernel. 

The set of vertices (arcs) of a digraph D will be denoted by V(D) (A(D)). The 
number of vertices (arcs) of the digraph under consideration will be denoted n (m). 
For a vertex x of a subgraph H of a digraph D, N^(x) and N H (x) denote the sets of 
out-neighbors and in-neighbors of x, respectively. Also, let At(x) = {xy : y e N^(x)}, 
dtf(x) = \Nff(x)\, and d H (x) = \N H (x)\. When H — Z) we will omit the subscripts in the 
notation above. 

Let D be a digraph, T an out-tree and L c V(D). A (T,L)-out-tree of D is an 
out-tree 7" of D such that (1) A(T) c A(7"), (2) L are leaves in 7", (3) T and 7" 
have the same root. A (T, L)-out-branching is a (J, L)-out-tree which is spanning. Let 
imaxiP, T, L) be the maximum number of leaves over all (T, L)-out-branchings of D. 
We set this number to if there is no (T, L)-out-branching. For an out-tree T in a 
digraph D, Leaf(r) denotes the set of leaves in T and Int(T) = V(T) - Leaf(7"), the set 
of internal vertices of T, For any vertex x in a tree T let T x denote the maximal subtree 
of T which has x as its root. 

Throughout this paper we use a triple (D, T, L) to denote a given digraph D, an out- 
tree T of D and a set of vertices L c V(D) - Int(T). We denote by D(T, L) the subgraph 
of D obtained after deleting all arcs out of vertices in L and all arcs not in A(T) which 
go into a vertex in V(T). When T and L are clear from the context we will omit them 
and denote D(T, L) by D. For further terminology and notation on directed graphs, one 
may consult J5). The following simple lemma will be used in the rest of the paper. 

Lemma 1.1. fi5§ A digraph D has an out-branching if and only if D has a single 
strong component without incoming arcs. One can decide whether a digraph has an 
out-branching in time 0(n + m). 

2 Another 4 k n 0( 1 } Time Algorithm 

The algorithm of this section is similar to the algorithm in ll2D . but it differs from the 
algorithm in ETI as follows. We decide in an earlier stage which one of the current 
leaves of T cannot be a leaf in a final (T, L)-out-branching and make them to be internal 
vertices based on Lemma [2731 see step 2 in Algorithm M(D, T, L). This decision works 
as a preprocessing of the given instance and gives us a better chance to come up with 
a (T, L)-out-tree with at least k leaves more quickly. A more important reason for this 
step is the fact that our algorithm is easier than the main algorithm in [21 1 to transform 
into a faster algorithm. 

The following simple result was used in 1 1 2 1 and its proof can be found in |2TI . 

Lemma 2.1. If there is an out-branching rooted at vertex r, whenever we have an out- 
tree rooted at r with at least k leaves we can extend it to an out-branching rooted at r 
with at least k leaves in time 0(m + n). 

Lemma 2.2. Given a triple (D, T, L), we have £ m ax(D, T, L) = ( mllx (D, T, L). 



3 



Proof. If there is no (T, L)-out-branching in D, the subgraph D does not have a (T, L)- 
out-branching either and the equality holds trivially. Hence suppose that T* is a (T, L)- 
out-branching in D with t mdx (D, T, L) leaves. Obviously we have t mdx (D, T, L) > 
Anax(A T, L). Since the vertices of L are leaves in T*, all arcs out of vertices in L 
do not appear in T\ i.e. A(T*) c A(D) \ {A+(x) : x e L). Moreover A(T) c A(T*) 
and thus all arcs not in A(T) which go into a vertex in V(T) do not appear in T* since 
otherwise we have a vertex in V(T) with more than one arc of T* going into it (or, the 
root has an arc going into it). Hence we have A(T*) c A0) and the above equality 
holds. □ 

Lemma 2.3. Given a triple (D, T, L), the following equality holds for each leaf x ofT. 

( mdx (D, T, L) = max{£ max (A T,LU {x}), £ mdx (D, T U A+ (x), L)} 

Proof. If ( mdx {D, T, L) — then the equality trivially holds, so we assume that { mdx (D, T, L) > 
1. Since any (T, L U {x})-out-branching or (T U At (x), L)-out-branching is a (T,L)- 
out-branching as well, the inequality > obviously holds. To show the opposite direc- 
tion, suppose T is an optimal (T, L)-out-branching. If x is a leaf in T , then T is a 
(r, L U {x})-out-branching and € mdx {D, T, L) < ( mdx (D, T,LU {x}). 

Suppose x is not a leaf in T . Delete all arcs entering Nt(x) in T , add At (x) and let 
T" denote the resulting subgraph. Note that d T „(y) = 1 for each vertex y in T" which is 
not the root and A(T") c A(D). In order to show that T" is an out-branching it suffices 
to see that there is no cycle in T" containing x. If there is a cycle C containing x in 
T" and xy e A(C), then C - [xy] forms a directed (y, x)-path in D. However this is a 
contradiction as x e V(T) and y £ V(T) and there is no path from V(D) - V(T) to V(T) 
in D. Hence T" is an out-branching. 

As no vertex in L has any arcs out of it in D we note that L c Leaf(7'"). Furthermore 
we note that A(T) c A(T") as A(r) c A(T') and all arcs we deleted from A(T') go to 
a vertex not in V(T). Therefore T" is a (T, L)-out-branching which has as many leaves 
as T. This shows ( mdx (D, T, L) < ( mdx (D, T U A+,(x), L). □ 

Definition 2.4. Given a fn]p/e (D, T, L) and a vertex x e LeaflT) - L, define T r ^f{x) as 
follows. 

(1) x' '.— x. 

(2) While dt(x') = 1 add At(x') = {x'y} to T anc/ let x' := y. 

(3) AddAt(x')toT. 

Now let r™°'(x) = T x . That is, T™°'(x) contains exactly the arcs added by the 
above process. 

The idea behind this definition is the following: during the algorithm, we will 
decide that a given leaf x of the partial out-tree T built thus far is not a leaf of the 
out-branching we are looking for. Then adding the out-arcs of x to T is correct. To 
make sure that the number of leaves of T has increased even when dy_ v ^(x) = 1, 
we add T"""(x) to T instead of just adding the single out-arc of x, as described in the 
following. 
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Lemma 2.5. Suppose we are given a triple (D, T,L) and a leaf x e LeafiT) — L. If 
ZmnxfD, T,LL> {x}) > 1 then the following holds. 

(i) If\Leaf(T r °f(x))\ > 2 then £ max (D, T, L) = max{^ max (D, T,LU \x}), ( max (D, T U 

T^(x),'L). 

(ii) If\Leaf(T^(x))\ = 1 then £ m , x (D, T, L) = i mnx (D, T,LU {*}). 

Proof. Assume that T is an optimal (T, L)-out-branching and that |Leaf(rj.)| = 1. We 
will now show that £ mdlX (D, T,LU {x}) = |Leaf(7")| = WCA T, L). If x is a leaf 
of T then this is clearly the case, so assume that x is not a leaf of T . Let y be the 
unique out-neighbor of x in T . As {max(D, T,LU {x}) > 1 we note that there exists 
a path P - P0P1P2 ■ ■ -Pri= y) from the root of T to y in D(T,L U {x}). Assume that 
q is chosen such that p q <£ T' x and {p q +\,Pq+2, ■ ■ - ,pA £ V(TQ. Consider the digraph 
D* = D[V(T' X ) U {p q } — {x}] and note that p q can reach all vertices in D* . Therefore 
there exists an out-branching in D*, say T*, with p q as the root. Let T" be the out- 
branching obtained from T by deleting all arcs in T' x and adding all arcs in T* . Note 
that |Leaf(7"')| > |Leaf(7")| as Leaf(r*)U{x) are leaves in T" andLeaf(T£)U{/> 9 } are the 
only leaves in T which may not be leaves in T" (and |Leaf(r^.) U \p q }\ = 2). Therefore 
^max(£>, T,LU {x}) > |Leaf(T')| = tmiJP, T, L). As we always have e mm (D, T, L) > 
(maAD, T,L\J {x}) we get the desired equality. 

This proves part (ii) of the lemma, as if |Leaf(T^°?'(^:))| = 1 then any optimal {T, L)- 
out-branching T', must have |Leaf(r(.)| = 1. 

We therefore consider part (i), where |Leaf(r^°2'(x))| > 2. Let Q denote the set 
of leaves of T r °f{x) and let R - V(T™f(x)) - Q. Note that by the construction of 
r™'(x) the vertices of R can be ordered (x -)r\ ,r-i,..., r,- such that r\r^ . . . , r,- is a path 
in T™*{x). As before let T be an optimal (T, L)-out-branching and note that if any r, 
(1 < j < 1) is a leaf of T then |Leaf(r' c )| = 1 and the above gives us £ mdx (D, T, LUjij) = 
{ max (D, T, L). This proves part (i) in this case, as we always have ( m&x {D, T, L) > 
{ max (D, T U Tq1*(x), L). Therefore no vertex in f/i , r%, ■ ■ ■ , r,-} is a leaf of T and all arcs 
(x =)r\rz, r^r^,, . . ., r,-ir,- belong to T . By Lemma l2~3l we may furthermore assume that 
T contains all the arcs from r, to vertices in Q. Therefore T™?(x) is a subtree of T 
and £ max (D, T, L) = £ max (D, T U T r °f(x), L). This completes the proof of part (i). □ 

The following is an 0(A k n° m ) algorithm. Its complexity can be obtained similarly to 
IF2TI . We restrict ourselves only to proving its correctness. 

For every vertex x e V(D), do MD, {x}, 0). 

If one of the returns of Jl(D, {x}, 0) is "YES" then output "YES". 

Otherwise, output "NO". 

M(D, T, L): 

(1) If £ m - dx (D, T, L) = 0, return "NO". Stop. 

(2) While there is a vertex x e Leaf(r)-L such that £ max (D, T, W{x}) = 0, 

add the arcs At (x) to T. 

n J 
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(3) If \L\ > k, return "YES". Stop. 

If the number of leaves in T is at least k, return "YES". Stop. 
If all leaves in T belong to L, return "NO". Stop. 

(4) Choose a vertex x e LeafiT) - L. 

B x := ${(D, T,LU {x}) and B 2 :="NO". 

If |Leaf(r™' (jc))| > 2 then let B 2 := MD, T U Tg£(x), L). 

Return "YES" if either B x or B 2 is "YES". Otherwise return "NO". 

Remark 2.6. While the first line in step 3 is unnecessary, we keep it since it is needed 
in the next algorithm where L C LeafiT) is not necessarily true, see (4.2) in the next 
algorithm, where po t V(T). 

Theorem 2.7. Algorithm Ji(D, T, L) works correctly. In other words, D has a (T, L)- 
out-branching with at least k leaves if and only if Algorithm Ji{D, T, L) returns "YES ". 

Proof. We begin by showing that a call to Jl{D, T, L) is always made with a proper 
argument (D, T, L), that is, T is an out-tree of D and L n Int(T) = 0. Obviously the 
initial argument (D, {x}, 0) is proper. Suppose (D, T, L) is a proper argument. It is easy 
to see that (D, T,LU {x}) is a proper argument. Let us consider [D, T U T r ^(x), L). By 
Definition |23] we note that T U Tp?(x) is an out-tree in D and since we consider the 
digraph D at each step in Definition ^. 4l we note that no vertex in L is an internal vertex 
of T U T r °£(x). Hence (D, T U T r °f(x), L) is a proper argument. 

Consider the search tree ST that we obtain by running the algorithm Jl(D, T,L). 
First consider the case when S T consists of a single node. If 3\.(D, T, L) returns "NO" 
in step 1, then clearly we do not have a (T, L)-out-branching. Step 2 is valid by Lemma 
12.31 i.e. it does not change the return of J[{D,T,L). So now consider Step 3. As 
Anax(A T,L) > 1 after step 1, and by Lemma [2731 the value of ( max (D, T,L) does not 
change by step 2 we note that £ m!ix (D, T,L) > 1 before we perform step 3. Therefore 
there exists a (T, L)-out-branching in D. If \L\ > k or |Leaf(r)| > k then, by Lemma l2~Tl 
any (T, L)-out-branching in D has at least k leaves and the algorithm returns "YES". If 
Leaf(r) c L then the only [T, L)-out-branching in D is T itself and as |LeafiT)| < k the 
algorithm returns "NO" as it must do. Thus, the theorem holds when S T is just a node. 

Now suppose that S T has at least two nodes and the theorem holds for all succes- 
sors of the root R of ST. By the assumption that R makes further recursive calls, we 
have { m ax(D, T,L)> 1 and there exists a vertex x e Leaf(T) — L. If there is a (T, ^-out- 
branching with at least k leaves, then by Lemma l2~51 there is a (T, LU{x})-out-branching 
with at least k leaves or (T U T^(x), L)-out-branching with at least k leaves. By induc- 
tion hypothesis, one of B\ or B 2 is "YES" and thus Jl(D, T, L) correctly returns "YES". 
Else if £ m - dx (D, T,L) < k, then again by Lemma |2~51 and induction hypothesis both B\ 
and B 2 are "NO". Therefore the theorem holds for the root R of ST, which completes 
the proof. □ 

3 Faster Algorithm 

We now show how the algorithm from the previous section can be made faster by 
adding an extra vertex to the set L in certain circumstances. Recall that Step 2 in the 
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above algorithm !R{D, T, L) and in our new algorithm S(D, T, L) is new compared to 
the algorithm in [21 1. We will also allow L to contain vertices which are not leaves of 
the current out-tree T. The improved algorithm is now described. 

For every vertex x e V(D), do S(D, {x}, 0). 

If one of the returns of S(D, {x}, 0) is "YES" then output "YES". 

Otherwise, output "NO". 

S(D, T, L) : 

(1) If 4iax(A T, L) = 0, return "NO". Stop. 

(2) While there is a vertex x e Leaf(r)-L such that ( m:lx (D, T, W{x}) = 0, 

then add the arcs At (x) to T. 

(3) If |L| > k, return "YES". Stop. 

If the number of leaves in T is at least k, return "YES". Stop. 
If all leaves in T belong to L, return "NO". Stop. 

(4) Choose a vertex x e Leaf(T) - L, color x red and let H x :- D. 

(4.1) Let z be the nearest ancestor of x in T colored red, if it exists. 

(4.2) Let I! := L U {x}. 

If z exists and T, has exactly two leaves x and x' and x' e L 
then: 

Let P = pqP\ ... p r be a path in H z - At(z) such that V(P) - 
V(T ; ) = {/?(,} and p r e Ni(z), and let L' := L U {p , x). 

(4.3) Bi := S(D, T,L') and B 2 :="NO". 

(4.4) If |Leaf(r™f W)l > 2 then let B 2 := S(D, T U r™f (jc), L). 

(4.5) Return "YES" if either By or B 2 is "YES". Otherwise return 
"NO". 

The existence of P in step (4.2) follows from the fact that z was colored red, hence 
adding z to L would not have destroyed all out-branchings. Note that po does not 
necessarily belong to T. 

For the sake of simplifying the proof of Theorem l3.2l below we furthermore assume 
that the above algorithm picks the vertex x in Step 4 in a depth-first manner. That is, 
the vertex x is chosen to be the last vertex added to T such that x e Leaf(T) - L. 

Theorem 3.1. Algorithm S(D, T, L) works correctly. In other words, D has a (T, L)- 
out-branching with at least k leaves if and only if Algorithm S(D, T, L) returns "YES". 

Proof. The only difference between !B(D, T, L) and J[(D, T, L) is that in step (4.2) we 
may add an extra vertex po to L which was not done in Jl{D, T, L). We will now prove 
that this addition does not change the correctness of the algorithm. 

So assume that there is an optimal (T, L)-out-branching T with x e Leaf(T') but 
po i Leaf(r')- We will show that this implies that an optimal solution is found in the 
branch of the search tree where we put z into L. This will complete the proof as if an 
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Figure 1: Real lines represents T arcs; dashed lines represent the reachability of po\ 
dotted lines represent the reachability of wq. 

optimal (T, L)-out-branching T does not contain x as a leaf, by Lemma |231 it is found 
in S(D, T U T'^(x), L) and if it includes both x and po as leaves then it is found in 
B(D, T, U) (in step (4.3)). 

Note that T z - T. as T z had exactly two leaves x and x' and xf e L and we have 
just assumed that x is a leaf of T . Let D* = D[V(T' Z ) U {po} - {z}] and consider the 
following two cases. 

If po can reach all vertices of D* in D* then proceed as follows. Let T* be an out- 
branching in D* with po as the root. Let T" be the out-branching obtained from T by 
deleting all arcs in T' z and adding all arcs in T*. Note that |Leaf(r")| > |Leaf(r')| as 
Leaf(T*) U {z} are leaves in T" and Leaf(T') are the only two leaves in T which may 
not be leaves in T" . Therefore an optimal solution is found when we add z to L. 

So now consider the case when po cannot reach all vertices of D* in D*. This 
means that there is a vertex u e Nj{z) which cannot be reached by po in D*. All such 
unreachable vertices lie on the same branch of T z (the branch not containing p r ). Let 
W = wqvvi W2 ■ ■ ■ wiu be a path from the root of T to u, which does not use any arcs 
out of z (which exists as z was colored red in step (4.1), so adding z to L at this stage 
would not destroy all out-branchings). Assume that a is chosen such that w a t T' and 
{w a +\ , w a +2, ■ ■ ■ , W;, u) C V(T' Z ) (see Figure 1). 

Consider the digraph D" - D[ V(T') U {po, w a ) — {z}] and note that every vertex in 
D" can be reached by either po or w a in D" . Therefore, there exists two vertex disjoint 
out-trees T pa and T Wa rooted at p and w a , respectively, such that V(T po ) U V{T Wa ) = 
V(D") (to see that this claim holds add a new vertex y and two arcs ypo and yw a ). 
Furthermore since po cannot reach u in D* we note that both T po and T Wa must contain 
at least two vertices. Let T" be the out-branching obtained from T by deleting all 
arcs in T' and adding all arcs in T Po and in T Wa . Note that |Leaf(T'")| > |Leaf(T')l as 
Leaf^p,,) U Leaf(r„,J U {z} are leaves in T" and Leaf(T') U \w a } are the only three 
vertices which may be leaves in T but not in T" . Therefore again an optimal solution 
is found when we add z to L. □ 
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Theorem 3.2. Algorithm <B(D, T, L) runs in time 0(3J2 k n° m ). 

Proof. For an out-tree Q, let {(Q) = \Leaf(Q)\. Recall that we have assumed that 
S(D, T, L) picks the vertex x in Step 4 in a depth-first manner. 

Consider the search tree S T that we obtain by running the algorithm S(D, \x), 0). 
That is, the root of S T is the triple (D, \x}, 0). The children of this root is (D, {x}, L') 
when we make a recursive call in step (4.3) and (D, r™°'(x), 0) if we make a recursive 
call in step (4.4). The children of these nodes are again triples corresponding to the 
recursive calls. 

Let g(T,L) be the number of leaves in a subtree R of ST with triple (D,T,L). 
Clearly, g(T, L) = 1 when (D, T, L) is a leaf of S T. For a non-trivial subtree R of ST, 
we will prove, by induction, that g(T,L) < ca k ~ e(T) {) k ~ lu , where a = 1.96, /3 = 1.896 
and c > a 2 /3 2 . Assume that this holds for all smaller non-trivial subtrees. (Note that the 
value of c is chosen in such a way that in the inequalities in the rest of the proof, we 
have upper bounds for g(T*, L*) being at least 1 when (D, T*, L*) is a leaf of S T.) 

Recall that x e Leaf(r) - L was picked in step (4). Now consider the following 
possibilities. 

If \L'\ — \L\ + 2, then the number of leaves of R is at most the following as if a call 
is made to S(D, T U T r °f(x), L) in (4.4) then the number of leaves of T increases by at 
least one: 

g(T, L') + g(T U T r °°l(x), L) < ca k - e( - T) p k -M- 2 + ca k - t(T) - 1 ^ 

So we may assume that \L'\ = \L\ + 1 in (4.3). Now assume that |Leaf(r™£'(x))l + 2 
in (4.4). In this case either no recursive call is made in (4.4) or we increase the number 
of leaves in T by at least two. Therefore the number of leaves of R is at most 

ca k- (( .T )/3 k-\L\-l + ca *-«7>2g*HL| = ca k-((T)0c-\U (1 + 

So we may assume that \L'\ = \L\ + 1 in (4.3) and |Leaf(r™f(x))| = 2 in (4.4). Let 
V = T U T%f(x) and consider the recursive call to S(D, T'',L). If we increase the 
number of leaves in T in step (2) of this recursive call, then the number of leaves of 
the subtree of S T rooted at (D, T' , L) is at most 

ca k-((T>)-ip k -\L\-i + ca k-e(T>)-20c-\u = ca k-ar )l3 k-\L\ (-L + J_) . 

Therefore, as £{T') — £(T) + 1, the number of leaves in R is at most 
g(T, L>) + g{T>, L) < caWnffW-i + c ^t)-^-\l\ + ^) 

So we may assume that we do not increase the number of leaves in step (2) when 
we consider (D, T', L). Let y and y' denote the two leaves of T' x (after possibly adding 
some arcs in step (2)). Consider the recursive call to S(D, T', L U \y}). If we increase 
the number of leaves of 7" in step (2) in this call then the number of leaves in R is at 
most 
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g(T,LU{x}) + g(T',LU{y}) + g(T' U (T')%*(y),L) 

< C a^y-wn +( i + i )+ n 

~ V p v a-j3- a s B 7 a- } 

So we may assume that we do not increase the number of leaves in step (2) when we 
consider (D, T',LU{y}). However in this case we note that \L'\ — \L\ + 2 in this recursive 
call as when we consider y' the conditions of (4.2) are satisfied as, in particular, T x has 
exactly two leaves). So in this last case the number of leaves in R is at most 

g(T,LU{x}) + g(T',LU{y}) + g(T' U (T')'^(y), L) 

< ca k -^(3 k -M (I + ( i + i ) + 4) 

We increase either \L\ or £{T) whenever we consider a child in the search tree and 
no non-leaf in ST has \L\ > k or {(T) > k. Therefore, the number of nodes in S T is 
at most 0(ka k /3 k ) = 0(3 J2 k ). As the amount of work we do in each recursive call is 
polynomial we get the desired time bound. □ 



4 Exponential Algorithm for Directed Maximum Leaf 

Note that Directed Maximum Leaf can be solved in time 0(2 n n° m ) by an exhaustive 
search using Lemma 1X7X1 Our 3.12 k n° m algorithm for Directed £-Leaf yields an im- 
provement for Directed Maximum Leaf, as follows. 

Let a = 0.526. We can solve Directed Maximum Leaf for a digraph D on n vertices 
using the following algorithm ADML: 

Stage 1. Set k :- \an\ For each x e V(D) apply B(D, {x},0) to decide whether D 
contains an out-branching with at least k leaves. If D contains such an out- 
branching, go to Stage 2. Otherwise, using binary search and &{D, {x}, 0), return 
the maximum integer t for which D contains an out-branching with t leaves. 

Stage 2. Set t := \an\. For k = t + \,t + 2, . . ,,n, using Lemma [XT1 decide whether 
D(d>, S ) has an out-branching for any vertex set S of D of cardinality k and if the 
answer is "NO", return k - 1 . 

The correctness of ADML is obvious and we now evaluate its time complexity. Let 
r = \an\ Since 3.72° < 1.996, Stage 1 takes time at most 3.12 r n° m = 0(1.996"). 
Since ,,,, 1 < 1.9973, Stage 2 takes time at most 

a*(l— ay " & 

n ) . n om = I L—\ n om = 0(1.9973"). 

rj \a a {\-ay- a j 

Thus, we obtain the following: 

Theorem 4.1. There is an algorithm to solve Directed Maximum Leaf in time 0( 1 .9973"). 
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5 Linear Kernel for Directed &-Leaf restricted to Acyclic 
Digraphs 

Lemma 11.11 implies that an acyclic digraph D has an out-branching if and only if D 
has a single vertex of in-degree zero. Since it is easy to check that D has a single 
vertex of in-degree zero, in what follows, we assume that the acyclic digraph D under 
consideration has a single vertex s of in-degree zero. 
We start from the following simple lemma. 

Lemma 5.1. In an acyclic digraph H with a single source s, every spanning subgraph 
of H, in which each vertex apart from s has in-degree 1, is an out-branching. 

Let B be an undirected bipartite graph with vertex bipartition (V, V"). A subset 
S of V is called a bidomination set if for each y e V" there is an x e S such that 
xy e E(B). The so-called greedy covering algorithm [4] proceeds as follows: Start 
from the empty bidominating set C. While V" + do the following: choose a vertex 
v of V of maximum degree, add v to C, and delete v from V and the neighbors of v 
from V". 

The following lemma have been obtained independently by several authors, see 
Proposition 10.1.1 in |4|. 

Lemma 5.2. If the minimum degree of a vertex in V" is d, then the greedy covering 
algorithm finds a bidominating set of size at most 1 + ^ (l + In ^jj^) ■ 

Let D be an acyclic digraph with a single source. We use the following reduction 
rules to get rid of some vertices of in-degree 1 . 

(A) If D has an arc a — xy with d + (x) — d~(y) — 1, then contract a. 

(B) If D has an arc a - xy with d + (x) > 2, d~(y) = 1 and x + s, then delete x and 
add arc uv for each u € N~(x) and v e N + (x). 

The reduction rules are of interest due to the following: 

Lemma 5.3. Let D* be the digraph obtained from an acyclic digraph D with a single 
source using Reduction Rules A and B as long as possible. Then D* has a k-out- 
branching if and only if D has one. 

Proof. Let D have an arc a - xy with d + (x) = d~(y) = 1 and let D' be the digraph 
obtained from D by contracting a. Let T be a A:-out-branching of D. Clearly, T contains 
a and let T be an out-branching obtained from T by contracting a. Observe that 7" 
is also a £-out-branching whether y is a leaf of D or not. Similarly, if D' has a ^-out- 
branching, then D has one, too. 

Let D have an arc a - xy with d + (x) > 2, d~(y) = 1 and x + s and let D' be 
obtained from D by applying Rule B. We will prove that D' has a ^-out-branching if 
and only if D has one. Let T be a ^-out-branching in D. Clearly, T contains arc xy and 
x is not a leaf of T. Let U be the subset of N + (x) such that xu e A(T) for each u e U 
and let v be the vertex such that vx e A(T). Then the out-branching T of D' obtained 
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from T by deleting x and adding arcs vu for every u e U has at least k leaves (T' is an 
out-branching of D' by Lemma l5Tt . Similarly, if D' has a fc-out-branching, then D has 
one, too. □ 



Now consider D*. Let B be an undirected bipartite graph, with vertex bipartition 
(V, V"), where V is a copy of V(D*) and V" is a copy of V{D*) - {s}. We have 
E(B) = \u'v" : u' e V, v" e V", uv e A(D*)}. 

Lemma 5.4. Let R be a bidominating set of B. Then D* has an out-branching T such 
that the copies of the leaves ofT in V' form a superset ofV' — R. 

Proof. Consider a subgraph Q of B obtained from B by deleting all edges apart from 
one edge between every vertex in V" and its neighbor in R. By Lemma I5TT1 Q corre- 
sponds to an out-branching T of D* such that the copies of the leaves of T in V form 
a superset of V - R. □ 

Theorem 5.5. If D* has no k-out-branching, then the number n* of vertices in D* is 
less than 6.6{k + 2). 

Proof. Suppose that «* > 6.6(k + 2); we will prove that D* has a ^-out-branching. 
Observe that by Rules A and B, all vertices of D* are of in-degree at least 2 apart from 
s and some of its out-neighbors. Let X denote the set of out-neighbors of s of in-degree 
1 and let X" be the set of copies of X in V". Observe that the vertices of V" - X" of 
B - X" are all of degree at least 2. Thus, by Lemma 15721 B - X" has a bidominating 
set S of size at most y(l + In 2) + 1 . Hence, S U {s} is a bidominating set of B and, by 
Lemma l574l D* has a fo-out-branching with b > n* - y(l + In 2) - 2. It is not difficult 
to see that b > f (1 -ln2)-2 > 0.153«*-2 > k. a 

6 Open Problems 

It would be interesting to see whether Directed £-Leaf admits an algorithm of signif- 
icantly smaller running time, say 0(3 k n° m ). Another interesting and natural question 
is to check whether a linear-size kernel exists for Rooted Directed £-Leaf (for all di- 
graphs). 
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